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DETAILED ACTION 

The Office Action of 21 July 2007 directed to the examination of non-elected SEQ ID 
NO: 1 is hereby VACATED. 

Restriction / Election 

1. Applicant's election with traverse of Applicant elects Group I, claims 1-14, 20-28, 
and SEQ ID NO: 2 only, in the reply filed on 3/16/07 is acknowledged. The traversal is on the 
ground(s) that '\ . .not seen as . . .undue burden . . This is not found persuasive because while a 
search of the prior art for one group may overlap with that of another group, they are not co- 
extensive of each other and thus would represent undue burden on Office resources. 

2. Claims 15-16, 18-19, 29-33 and sequences other than SEQ ID NO:. 2 are withdrawn 
from further consideration pursuant to 37 CFR 1 .142(b), as being drawn to a nonelected 
inventions, there being no allowable generic or linking claim. Applicant timely traversed the 
restriction (election) requirement in the reply filed on 3/16/07. 

3. Claims 1-14, 20-28 are examined in the instant application. 

4. The requirement is deemed proper and is therefore made FINAL. 

Claim Objections 

5. Claim 1 is objected to because of the following informalities: the claim recites non- 
elected SEQ ID NOs. Appropriate correction is required. 

6. Claims 9, 21 are objected to because of the following informalities: the claim recites 
the . . host organism ... is ... a plant cell." Plant cells are not organisms. Appropriate 
correction is required. 
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7. Claim 1 is objected to because of the following informalities: the claim recites an 
improper Markush group drawn to 2 members of a genus and a separate genus. Appropriate 
correction is required. 

Claim Rejections - 35 U.S.C. §112, second paragraph 

The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

8. Claims 1-14 and 20-28 rejected under 35 U.S.C. §112, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which the 
applicant regards as the invention. 

9. In Claims 1-14 and 20-28, it is unclear what is being retained in the derived product. 

Claim Rejections - 35 USC § 112, r\ paragraph, written description 
The following is a quotation of the first paragraph of 35 U.S.C. 1 12: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

10. Claims 1-14 and 20-28 are rejected under 35 U.S.C. 1 12, first paragraph, as failing to 
comply with the written description requirement. The claim(s) contains subject matter which 
was not described in the specification in such a way as to reasonably convey to one skilled in the 
relevant art that the inventor(s), at the time the application was filed, had possession of the 
claimed invention. 
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The claims are broadly drawn to an inducible promoter, SEQ ID NO: 2 (NlMFN-l 
promoter), biologically active derivatives of SEQ ID NO: 2 of any length and sequence from any 
source, "chemicals" and "organic compounds" capable of inducing SEQ ID NO: 2, and 
undescribed heterologous nucleic acid sequences encoding wild-type or mutant NIMIN-1 
promoters. Said sequences include genes encoding promoters from any source (Claim 1-14, 20- 
28), as well as any sequence from any source encoding any promoter which has any biological 
activity of any kind, including irreversible self-excision from the genome. 

In contrast, Applicant has only described SEQ ID NO: 2 (sequence listing), GUS expression 
data with NIMIN-2 (SEQ ID NO: 2)-GUS fusions (Examples 2-3), and transgenic tobacco plants 
containing SEQ ID NO: 2-GUS constructs (Examples 2-3). 

Applicant does not describe biologically active derivatives of SEQ ID NO: 2 of any 
length and sequence from any source, any "chemicals" (e.g., mercury or water) or any "organic 
compounds"' (e.g., cyanide or glyphosate) capable of inducing SEQ ID NO: 2, and undescribed 
heterologous nucleic acid sequences encoding wild-type or mutant SEQ ID NO: 2 promoters. 
Said sequences include genes encoding promoters from any source (Claim 1-14, 20-28), as well 
as any sequence from any source encoding any promoter with any or no promoter activity. 

Applicant has not described the structure or any other relevant characteristics for all 
nucleic acid sequences encoding biologically active derivatives of NIMIN-2 promoters, or a 
representative number of same and a literature review does not indicate that they are well known 
to one of skilled in the art. Applicant has only described nucleic acid sequences encoding SEQ 
ID NO: 2 from Arabidopsis, 
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Applicant fails to describe a representative number of biologically active derivatives of 
SEO ID NO: 2, or chemicals or organic compounds capable of inducing said biologically active 
derivatives. Applicant only describes SEQ ID NO: 2. Furthermore, Applicant fails to describe 
structural features common to members of the claimed genus of biologically active derivatives of 
SEQ ID NO: 2. Hence, Applicant fails to meet either prong of the two-prong test set forth by 
Eli Lilly, Furthermore, given the lack of description of the necessary elements essential for 
NIMlN-2 promoter activity, it remains unclear what features identify NIMIN-2 promoters. Since 
the genus of NlMIN-2 promoters or the genus of biologically active fragments thereof has not 
been described by either specific structural features or a representative number of species, the 
specification fails to provide an adequate written description to support the breath of the claims. 

The Federal Circuit has clarified the application of the written description requirement. 
The court stated lhat a written description of an invention "requires a precise definition, such as 
by structure, formula, [or] chemical name, of the claimed subject matter sufficient to distinguish 
it from other materials." University of California v. Eli Lilly and Co,, 1 19 F.3d 1559, 1568. The 
court also concluded that ''naming a type of material generally known to exist, in the absence of 
knowledge as to what that material consists of, is not a description of that material." Eli Lilly, 
Further, the court held that to adequately describe a claimed genus, Patent Owner must describe 
a representative number of the species of the claimed genus, and that one of skill in the art 
should be able to ''visualize or recognize the identity of the members of the genus." Jd. 

Finally, the court held: 

A description of a genus of cDNAs may be achieved by means of a 
recitation of a representative number of cDNAs, defined by nucleotide sequence, 
falling within the scope of the genus or a recitation of structural features common 
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to members of the genus, which features constitute a substantial portion of the 
genus. Id, 

See also MPEP Section 2163, page 174 of Chapter 2100 of the August 2005 version, 
column 1, bottom paragraph, where it is taught that 

|T]he claimed invention as a whole may not be adequately described 
where an invention is described solely in terms of a method of its making coupled 
with its function and there is no described or art-recognized correlation or 
relationship between the structure of the invention and its function. A 
biomolecule sequence described only by a functional characteristic, without any 
known or disclosed correlation between that function and the structure of the 
sequence, normally is not a sufficient identifying characteristic for written 
description purposes, even when accompanied by a method of obtaining the 
claimed sequence. 

See also Amgen Inc. v. Chugai Pharmaceutical Co. Ltd., 18 USPQ 2d 1016 at 1021, 
(Ted. Cir. 1991) where it is taught that a gene is not reduced to practice until the inventor can 
define it by "its physical or chemical properties" (e.g. a DNA sequence). 

Given the claim breadth and lack of guidance as discussed above, the specification fails 
to provide an adequate written description of the genus of sequences as broadly claimed. Given 
the lack of written description of the claimed genus of sequences, any method of using them, 
such as transforming plant cells and plants therewith, and the resultant products including the 
claimed transformed plant cells and plants containing the genus of sequences, would also be 
inadequately described. Accordingly, one skilled in the art would not have recognized Applicant 
to have been in possession of the claimed invention at the time of filing. See The Written 
Description Requirement guidelines published in Federal Register/ Vol. 66, No. 4/ Friday 
January 5, 2001/ Notices: pp. 1099-1111. 
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Claim Rejections - 35 U.S.C, §112^ first paragraph, enablement 

The following is a quotation of the first paragraph of 35 U.S.C. 1 12: 

The speci 11 cation shall contain a written description of the invention, and of the manner and process of making 
and using it. in such fulL clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by .the inventor of carrying out his invention. 

11.. Claims 1-14 and 20-28 are rejected under 35 U»S.C. 1 12, first paragraph, because the 

specification, while being enabling for SEQ ID NO: 2, does not reasonably provide enablement 

for biologically active derivatives of SEQ ID NO: 2. The specification does not enable any 

person skilled in the art to which it pertains, or with which it is most nearly connected, to make 

and use the invention commensurate in scope ^yith these claims. 

The Wands court set forth the enablement balancing test: 

Factors to be considered in determining whether a disclosure meets the 
enablement requirement of 35 USC 112, first paragraph, have been described by 
the court in In re Wands, 858 F.2d 731, 8 USPQ2d 1400, 1404 (Fed. Cir. 1988). 
Wands states at page 1404, "Factors to be considered in determining whether a 
disclosure would require undue experimentation have been summarized by the 
board in Ex parte Forman. They include (1) the quantity of experimentation 
necessary, (2) the amount of direction or guidance presented, (3) the presence or 
absence of working examples, (4) the nature of the invention, (5) the state of the 
prior art, (6) the relative skill of those in the art, (7) the predictability or 
unpredictability of the art, and (8) the breadth of the 'claims." 

M.P.E.P.§ 2164.01(a), 

The claims are broadly drawn to an inducible promoter, SEQ ID NO: 2 (NIMIN-2 

promoter), biologically active derivatives of SEQ ID NO: 2 of any length and sequence from any 

source, "chemicals" and "organic compounds" capable of inducing SEQ ID NO: 2, and 

undescribed heterologous nucleic acid sequences encoding wild-type or mutant NIMrN-2 

promoters. Said sequences include genes encoding promoters from any source (Claim 1-14, 20- 
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28), as well as any sequence from any source encoding any promoter which has any biological 
activity of any kind, including irreversible self-excision from the genome. 

In contrast. Applicant has only teaches SEQ ID NO: 2(sequence listing), GUS expression 
data with NIMlN-29 (SEQ ID NO: 2)-GUS fusions (Examples 2-3), and transgenic plants 
containing SEQ ID NO: 2-GUS constructs(Examples 2-3). 

Applicant do not teach biologically active derivatives of SEQ ID NO: 2 of any length 
and sequence from any source, "chemicals" (e.g., mercury or water) or "organic compounds" 
(e.g., cyanide or glyphosate) capable of inducing SEQ ID NO: 2, and undescribed heterologous 
nucleic acid sequences encoding wild-type or mutant SEQ ID NO: 2 promoters. Said sequences 
include genes encoding promoters from any source (Claim 1-14, 20-28), as well as any sequence 
from any source encoding any promoter which somehow "modulates" expression or activity of a 
gene of interest. 

The state-of-the-art is such that one of skill in the art cannot predict which "biologically 
active derivatives of SEQ ID NO: 2" will work a priori. Reviews by Kim, Hannenhalli, Maiti, 
and Doelling detail a variety of problems seen in promoter identification and modification. 

The state of the prior art, as exemplified by Kim et al (Plant Molecular Biology, vol. 24, 
pp. 1 05- 1 1 7, 1 994)) teaches the extreme sensitivity of promoter regions to single base pair 
changes, the absolute requirement for as few as 3 to 6 nucleotides for promoter function, and the 
failure of a prompter to function either constitutively or specifically when lacking 
oligonucleotide regions approximately 100 bp upstream of the transcription start site (page 106, 
paragraph bridging the columns; paragraph bridging pages 107 and 108; page 110, paragraph 
bridging the columns). In addition, the claimed nucleic acid sequence that is a biologically active 
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derivatives of SEQ ID NO: 2 would comprise non- functional transcriptional and translational 
elements, i.e. modifications of CAAT, TATA and the ATG codon, required for proper initiation 
of these cellular activities, known in the prior art; as well as highly conserved promoter regions 
rendered inactive by modifications. In addition, Applicant has not shown that any biologically 
active derivative of SEQ ID NO: 2 can also have the desired promoter activity. 

A recent review by Hannenhalli (2001) Bioinformatics 17: S90-S96) teaches that 
prediction of eukaryotic promoters has been one of the most elusive problems despite 
considerable eflbrl devoted to the study. {See the abstract at least). In the instant case of a 
prompter, Hannenhalli's review teaches a 50% failure in the sensitivity of promoter detection (p. 
S90, last full sentence). 

Twenty base-pair long regions of a DNA fragment that has promoter acfivity cannot 
predictably be assumed to also have promoter activity. Deletion analysis of various promoters 
have shown that even DNA segments from the portion of a promoter region containing sequence 
elements thought to be most important {e,g., the TATA-box) need to be longer than 20 basepairs. 
Maiti el al, in studies on a figwort mosaic virus promoter, found that the smallest portion 
upstream of the transcriptional start site that would support transcription was 198 basepairs long; 
segments of 73 and 37 basepairs did not work (1997, Transgen. Res., 6:143-156, see Fig. 4). 
Doelling et al found that the minimal rRNA promoter of Arabidopsis thaliam is at least 33 
nucleotides long (1995, Plant J. 8:683-692, see Fig. 1). 

The specification, while suggesting the use of the SEQ ID NO: 2, did not provide 
significant guidance on how to overcome art recognized problems in achieving expression of 
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promoter smaller "biologically active derivatives of SEQ ID NO: 2" of DNA, including single 
base pair polynucleotide promoters while still retaining activity. 

In addition, since the working examples disclosed in the specification are limited to 
unmodified SEQ ID NO: 2, the chemically inducible activity of sequences cannot be 
extrapolated to any derivatives thereof, absent specific guidance. While Applicant is not required 
to exemplify each and every claimed embodiment, specific guidance as to which region of the 
disclosed sequences can be modified, truncated so that the s chemically inducible activity 
promoter activity is retained is required. 

The specification has no working examples of sequences other than SEQ ID NO: 2 and 
no working examples of any "biologically active derivatives of SEQ ID NO: 2". 

Without sufficient guidance, identification of NIMIN-2 promoters is unpredictable and 
without guidance on how to overcome the problems seen in determining a priori which 
sequences will have NIMIN-2 promoter activity transgenic plants, it is unpredictable and the 
experimentation left to those skilled in the art is unnecessarily and improperly extensive and 
undue. 

Therefore, given the breadth of the claims; the lack of guidance and working examples; 
the unpredictability in the art; and the state-of-the-art as discussed above, undue trial and error 
experimentation would be required to practice the claimed invention, and therefore the invention 
is not enabled throughout the broad scope of the claims. 



Claim Rejections - 35 U.S.C. §102 
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The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public use or on 
sale in this country, more than one year prior to the date of application for patent in the United States. 

fe) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United Slates before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 351(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 

35 U.S.C, §102. 

12. Claims 1-5 are rejected under 35 U.S.C. 102(b) as being anticipated by Sato,S., 
(2000) Structural analysis of Arabidopsis thaliana chromosome 3. 1. Sequence features of the 
regions of 4,504,864 bp covered by sixty PI and TAC clones. J, DNA Res. 7 (2), 131-135. {See 
Appendix A). Sato discloses a recombinant nucleic acid containing at least a first nucleotide 
sequence operably linked to at least a second nucleotide sequence containing a transgene to be 
expressed, wherein the first nucleotide sequence contains a regulatory sequence selected from 
the group consisting of SEQ-ID-No. 2, and a biologically active derivative thereof, wherein the 
regulatory sequence is a promoter sequence selectively inducible by chemicals, wherein the 
chemicals are selected from the group consisting of organic compounds, wherein the organic 
compounds are selected from the group consisting of phenolic compounds, thiamine, benzoic 
acid, isonicotinic acid (INA), and derivatives thereof, wherein the phenolic compound is salicylic 
acid or a structural or functional derivative thereof, wherein the expression/transcription of said 
nucleotide sequence results in a detectable signal, a vector containing the recombinant nucleic 
acid according, and a bacterial host cell. {Id, p. 131, left column, 1^^ paragraph). Because 
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Sato discloses cloning host cells with cloning vectors, Sato discloses the limitations of the 
claimed invention. Thus, the reference discloses all the limitations of the Claimed invention. 

13. Claim 1-14, 20-28 rejected under 35 U.S.C. 102(e) as being anticipated by Cade et al 
(PGPPUB US-2004/0 154051, published 5 August 2004, filed 12 March 2004, See Appendix B). 

Cade discloses a recombinant nucleic acid containing a first nucleotide sequence operably 
linked to a second nucleotide sequence containing a transgene to be expressed, wherein the first 
nucleotide sequence contains a biologically active derivative of SEQ-ID-No. 2, wherein the 
regulatory sequence is a promoter sequence selectively inducible by salicylic acid, wherein the 
recombinant nucleic acid further contains a GUS reporter system which comprises at least one 
nucleotide sequence, wherein the expression/transcription of said nucleotide sequence results in a 
detectable signal, a bacterial or plant cell host organism containing a vector containing the 
recombinant nucleic acid, transgenic plant wherein said recombinant nucleic acid is stably 

r 

integrated, wherein the transgene contained in the second nucleotide sequence is transiently 
expressed, and wherein the expression of the transgene contained in the second nucleotide 
sequence is selectively induced upon treatment salicylic acid. (See examples 18, 19 and Claim 
11). 

Cade also discloses a bacterial or plant cell host organism containing said vector containing 
said polynucleotides encoding said biologically active fragment and said reporter system, 
wherein the transgene contained in the second nucleotide sequence is transiently expressed and 
selectively induced upon treatment with salicylic acid, said recombinant nucleic acid further 
containing a reporter system which comprises at least one nucleotide sequence, and wherein the 
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expression/transcription of said nucleotide sequence results in a detectable signal. (See examples 
18, 19 and Claim 11). 

Thus, the reference discloses all the limitations of the Claimed invention. 



14. No Claim is allowed. 



Any inquiry concerning this communication or earlier communications from the examiner should be directed to 
Brendan O. Baggot whose telephone number is 571/272-5265. The examiner can normally be reached on Tuesday through 
Thursday. 10:00 AM to 4:00 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, Anne Marie Grunberg can 
be reached on 571/272-0975. The fax phone number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the Patent Application Information Retrieval 
<PAIR) system. Status information for published applications may be obtained from either Private PAIR or Public PAIR. Status 
information for unpublished applications is available through Private PAIR only. For more information about the PAIR system, 
see hitp://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR system, contact the Electronic Business 
Center (BBC) at 866-217-9197 (toll-free). 



DAVID H.KRUSE,t>H.D. 
PRIMARY EXAMINER 
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APPENDIX A 

SCORE Search Results Details for Application 10566201 and Search Result us-10-566-201- 
2 . rge . 

Score Home Page Retrieve Application List SCORE System Overview SCORE FAQ Comments 
/ Suggestions 

This page gives you Search Results detail for the Application 10566201 and Search 
Result us-10-566-201-2 . rge. 



Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



September '28 , 2006, 01:51:24 ; Search time 11528 Seconds 

(without alignments) 
6800.793 Million cell updates/sec 



Title : 



Sequence : 



US-10-566-201-2 



Perfect score: 1226 



3 gatctctatgtatataaaaa ttgactaagcttaaacgacg 1226 



Scoring table : 1DENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



6366136 seqs, 31973710525 residues 



Total number of hits satisfying chosen parameters: 



12732272 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



GenEmbl : * 

1 : gb_env : * 

2 : gb_pa C : * 

3 : gb_ph : * 

4.: gb_pl:* 

5 : gb_pr : * 

6 : gb_ro : * 

7 : gb_sts : * 

8: gb_sy;* 

9 : gb_un : * 

1 0 : gb_vi : * 
1 1 : gb_OV : * 
12: gb_htg:* 
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13: gb_in:* 
14 : gb_om : * 
15: gb_ba:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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4 


AC005142 


AC005142 Arabidops 


c 


13 


6 91 


56 


4 


159629 


4 


ATCHRIV9 


AL1614 97 Arabidops 


c 


14 


553 . 2 


45 


1 


95190 


4 


AC007203 


AC007203 Arabidops 




15 


231.8 


18 


9 


105223 


4 


AC007399 


AC007399 Arabidops 


c 


16 


114 . 6 


9 


3 


349980 


2 


AX344555 


AX344555 Sequence 




17 


112 


9 


1 


175544 


12 


AC117342 


AC117342 Rattus no 




18 


1 .1 1 . 2 


9 


1 


4660 


2 


CS083843 


CS083843 Sequence 




19 


111 . 2 


9 


1 


90550 


5 


AL592166 


AL592166 Human DNA 


c 


20 


108 . 8 


8 


9 


109786 


4 


F5K24 


AF128395 Arabidops 


c 


21 


108 .8 


8 


9 


183181 


4 


ATCHRIV19 


AL161507 Arabidops 




22 


107.6 


8 


8 


1524 


2 


CSO'83838 


CS083838 Sequence 




23 


107.6 


8 


8 


212999 


12 


AC151201 


AC151201 Bos tauru 


c: 


24 


10 7.4 


8. 


8 


170627 


12 


AC125567 


AC125567 Rattus no 


c 


25 


106 . 4 


8 


7 


15548 


2 


AX347057 


AX34 70 5 7 Sequence 




26 


105 . 8 


8 


6 


47403 


2 


AX059535 


AX059535 Sequence 




27 


105 .8 


' 8 


6 


91470 


4 


T4B21 


AF118223 Arabidops 




28 


105,8 


8 


6 


110000 


2 


AR777056_04 


Continuation (5 of 




29 


105.8 


8 


6 


■200001 


4 


ATCHRIV13 


AL161501 Arabidops - 




30 


105 , 6 


8 


6 


154563 


5 


CR936360 


CR9363 60 Human DNA 


c 


31 


104 , 4 


8 


5 


2131 


2 


CS083950 


CS083950 Sequence 


c 


32 


104 .4 


8 


5 


172816 


5 


AC093899 


ACO 938 99 Homo sapi 




33 


104,2 


8 


5 


241619 


12 


AC167751 


AC167751 Bos tauru 


c 


34 


103 . 8 


8 


5 


166501 


12 


CR548626 


CR548626 Danio rer 


c 


35 


103 . 2 


8 


4 


7218 


2 


166494 


166494 Sequence- 14 


c 


36 


101 . 6 


8 


3 


143331 


5 


AC091214 


AC091214 Homo sapi 


c 


37 


100 . 8 


8 


2 


169510 


12 


. CR855864 


CR855864 Danio rer 


c 


38 


ino . 8 


8 


2 


178273 


12 


AC005308 


AC005308 Plasmodiu 




39 


100.8 


8 


2 


250531 


13 


AE014845 


AE014845 Plasmodiu 




40 


100 . 6 


8 


2 


155942 


12 


AC136691 


AC136691 Homo sapi 




41 


100 . 6 


8 


2. 


161765 


5 


AC113190 


AC113190 Homo sapi 




42 


100 . 6 


8 


2 


176479 


12 


AC135631 


AC135631 Homo sapi 


c 


43 


100 . 2 


8 


2 


192265 


5 


CNS018P3 


AL110118 Human chr 




44 


100 


8 


2 


154604 


11 


AL954739 


AL954739 Zebrafish 
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45 



99,8 8.1 67970 13 PFMAL1P3 



AL031746 Plasmodiu 



ALIGNMENTS 



RESULT 3 
AB023041/C 

I.OCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



AB023041 • 83650 bp DNA ' linear PLN 14-FEB-2004 

Arabidopsis thaliana genomic DNA, chromosome 3, PI clone: MPEll.- 
AB023041 BA000014 
AB02 3041 . 1 GI : 4 22064 0 

Arabidopsis thaliana (thale cress) 
Arabidopsis thaliana 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta"; 
Spermatophyta; Magnoliophyta ; eudicotyledons ; core eudicotyledons ; 
rosids; eurosids II; Brassicales; Brassicaceae ; Arabidopsis. 

1 

Sat o, S , , Nakamura,Y., Kaneko,T,, Katoh,T., Asamizu,E. and Tabata,S. 

Structural analysis of Arabidopsis thaliana chromosome 3.. I, 
Sequence features of the regions of 4,504,864 bp covered by sixty 
PI and TAC clones 
DNA Res. 7 (2), 131-135 (2000) 

10819329 

2 (bases 1 to 83650} 

Sato,S., Nakamura,Y., Kaneko,T., Kato,T., Asamizu,E. and Tabata,S. 
Direct Submission 

Submitted { 01 - FEB- 1999 ) Yasukazu Nakamura, Kazusa DNA Research 
Institute, Department of Plant Gene Research; 1532-3, Yana, 
Kisarazu, Chiba 292-0812, Japan (E-mail : ynakamu®ka2usa . or . jp, 
Tel : 81-438-52-3 93 5, Fax:81-438-52-3934) 
Address for correspondence: kaos@kazusa.or.jp 

For the latest information on annotation of this clone, please see 
http : //www . kazusa , or , j p/kaos/cgi -bin/agd_graph . cgi?c=MPEll 
Genes with similarity to proteins in the databases are described in 
•product' or 'note' qualifiers. Genes that have no significant 
protein similarity are described as 'unknown protein'. 
The software programs used to predict genes include: Grail 
(Informatics Group, Oak Ridge National Laboratory, 
http : //compbio . ornl .gov/Grail-1 . 3/) , 

GENSCAN (Chris Burge, MIT, http://CCR-081.mit.edu/GENSCAN.html), 
NetGene2 (S.M. Hebsgaard, et al . , CBS, Technical University of 

Denmark, http://www.cbs.dtu.dk/services/NetGene2/) and 

Spl icePredictor (Volker Brendel, Stanford University, 

http: //greml inl , zool . iastate.edu/cgi-bin/sp.cgi) . 

Genes encoding tRNAs are predicted by tRNAscan-SE 

(Sean Eddy, Washington University School of Medicine, St. Louis, 

http : //genome . wustl . edu/eddy/ tRNAscan-SE/ ) . 

This sequence may not be the entire insert of this clone. It may be 
shorter because we remove overlaps between neighboring submissions. 
The 5' clone is K9I22 and the 3' clone is MJL14 . 

Location/Qualifiers 

1. .83650 

/organism= "Arabidopsis thaliana" 
/mol_type= "genomic DNA" 
/db__xref ="taxon: 3 702" 
/chromosome= " 3 " 
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/clone="MPEll" 
/clone_lib=: "Mitsui PI" 
/ecotype= "Columbia " 
exon complement ( 54 2 . .764) 

/inference= "non-experimental evidence, no additional 
details recorded" 

/note="CDS is reported in Acc# AP000599 . 

contains similarity to CHP-rich zinc finger protein 

gene_id: K9I22 .5" 

/number=l 

CDS join{1705. .2463,2548. .2739.2862. .2949,3037. .3107, 

3190. .3273,3410. .3514) 

/ihference= "non-experimental evidence, no additional 

details recorded" 

/note="gb| AAD5 513 9. 1 

gene_id:MPEll . 1" 

/codon_start=l 

/product= "dihydrolipoamide S-acetyltransf erase " 

/protein_id="BAB0104 7 . 1" 
/db_xref ="GI : 9279589" 

/translation="MTVRSKIREIFMPALSSTMTEGKIVSWIKTEGEKLAKGESWVV 
ESDKADMDVETFYDGYLAAIWGEGETAPVGAAIGLLAETEAEIEEAKSKAASKSSSS 

VAEAWPSPPPVTSSPAPAIAQPAPVTAVSDGPRKTVATPYAKKLAKQHKVDIESVAG 
TGPFGRITASDVETAAGIAPSKSSIAPPPPPPPPVTAKATTTNLPPLLPDSSIVPFTA 
MQSAVSKNMIESLSVPTFRVGYPVNTDALDALYEKVKPKGVTMTALLAKAAGMALAQH 
PWNASCKDGKSFSYNSSINIAVAVAINGGLITPVLQDADKLDLYLLSQKWKELVGKA 
RSKQLQPHEYNSGTFTLSNLGMFGVDRFDAILPPGQGAIMAVGASKPTWADKDGFFS 
VKNTMLVNVTADHRIVYGADLAAFLQTFAKIIENPDSLTL" 
CDS complement (4594 . .5106) 

/inference= "non-experimental evidence, no additional 
details recorded" 

/note= "unnamed protein product; gene_id : MPEll . 2 

unknown protein" 

/codon_start=l 

/protein_id="BAB01048 . 1" 

/db_xref="GI: 9279590" 

/ 1 rans 1 a t i on= " MEDLLEERLTRTDSVGNKRVRDGLDLDS PDVKRLRDDLFDDSGL 

DPVSQDLDSVMKSFENELSTTTTU^LSSGETQPDLGYLFEASDDELGLPPPLTPPQTLL 
PPSCEETVTELVRASSDSSEVGELCGFEDHVTEFGPCDLGDDGLFEYFDGCLDSGDLF 
SWRPEFLPAE" 

CDS complement (join(10090. .104 08,1064 9. .10804,10916. .10962)} 

/inference= "non-experimental evidence, no additional 

details recorded" 

/note= "unnamed protein product; gene_id : MPEll . 3 

similar to unknown protein 

sp|P42744" 

/codon_start=l 

/protein_id= "BABO 104 9-. 1 " 

/db_xref ="GI : 9279591" 

/translation="MMEPKAKYDRQLMYTIQGTLEEASICLLNCGPIGSNALKNLVLG 
GVGSITIVEGSKVLIGDIWKQFHRHAIEQKFSISEGFRDENNTVFQRREQHSVFQRQL 
EQNRIAGQTVRPMDIARRIWASIGRMWSLADRTTTYGKEARAITDPPFGRVLARLTSL 
VDRHLDFRRLCDI" 
CDS complement (11450 . .11818) 

/ in fe rence=: " non-experimental evidence, no additional 
details recorded" 

/note- "unnamed protein product; gene_id : MPEll . 4 
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Art Unit: 1638 

unknown protein" 
/codon_start=l 
/protein_id= "BAB01050 . 1" 
/db_xref ="GI : 9279592" 

/trans la tion= "MNNSLKKEERVEEDNGKSDGNRGKPSTEWRTVTEEEVDEFFKI 
LRRVHVATRTVAKVNGGVAEGELPSKKRKRSQNLGLRNSLDCNGVRDGEFDEINRVGL 
QGLGLDLNCKPEPDSVSLSL" 
CDS 16981. .17979 

/inference= "non-experimental evidence, no additional 
details recorded" 
/note= "gene_id:MPEll . 5 " 
/codon_start=l 

/product="AP2 domain transcription factor-like protein" 
/protein_id="BAB01051 . 1" 
/db_xref ="GI : 92 7 9593*' 

/translation="MAERKKRSSIQTNKPNKKPMKKKPFQLNHLPGLSEDLKTMRKLR 
FWNDPYATDYSSSEEEERSQRRKRYVCEIDLPFAQAATQAESESSYCQESNNNGVSK 

TKISACSKKVLRSKASPWGRSSTTVSKPVGVRQRKWGKWAAEIRHPITKVRTWLGTY 
ETLEQAADAYATKKLEFDALAAATSAASSVLSNESGSMISASGSSIDLDKKLVDSTLD 
QOAGESKKASFDFDFADLQIPEMGCFIDDSFIPNACELDFLLTEENNNQMLDDYCGID 
DLDIIGIjECDGPSELPDYDFSDVEIDLGLIGTTIDKYAFVDHIATTTPTPLNIACP" 
CDS join.(21893. .22063,22330. .223 86,22578. .22783,228 94. .23227, 

23311. .23382,23473. .23613) 

/ inference= non- experimental evidence, no additional 

details recorded" 

/note="gb| AAF23821 . 1 

gene_id:MPEll .6" 

/codon_start=l 

/product= "homocysteine S-methyltransf erase AtHMT-1" 
/protein_id="BAB010 52 . 1" 
/db_xref="GI : 9279594" 

/translation="MVLEKKSALLEDLIKKCGGCAWDGGFATQLEIHGAAINDPLWS 
AVSLIKNPELIKRVHMEYLEAGADIWTSSYQATIPGFLSRGLSIEESESLLQKSVEL 
AVEARDRFWEKVSKVSGHSYNRALVAASIGSYGAYLADGSEYSGHYGENVSLDKLKDF 
HRRRLQVLVEAGPDLLAFETIPNKLEAQACVELLEEEKVQIPAWICFTSVDGEKAPSG 
ESFEECLEPLNKSNNIYAVGINCAPPQFIENLIRKFAKLTKKAIWYPNSGEVWDGKA 
KQWLPSQCFGDDEFEMFATKWRDLGAKLIGGCCRTTPSTINAISRDLKRR" 
CDS complement (27277 . .27552) 

/inference="non-experimental evidence, no additional 
details recorded" 

/note= "unnamed protein product; gene_id : MPEll . 7 

unknown protein" 

/codon_start=l 

/protein_id="BAB01053 .1" 

/db_xref="GI: 9279595" 

/translation= "MTHAREWRSSLTTTLLMVILLSYMLHLFCVYSRVGAIRIFPETP 
ASGKRQEEDLMKKYFGAGKFPPVDSFVGKGISESKRIVPSCPDPLHN" 
CDS 30765. .31883 

/inference= "non-experimental evidence, no additional 
details recorded" 

/note="unnamed protein product; gb | AAF16548 . 1 

gene_id : MPEll . 8 

similar to unknown protein" 

/ codon_start= 1 
/protein_id= "BAB010 54 . 1" 
/db_xref ="GI : 92795 96 

/translations "MPKERKERSVSLDKYKRSPLCCEASLALKPSEKQVKEWEEARCP 
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VCMEHPHNGILLICSSYENGCRPYMCDTSHRHSNCFDQFRKASKEKPSLSLLREEEES 
NEPTEMEDVDSDSTAVNLLGEAASEITWDLSDGERGEEEVEEEEEEVWEEEEEGIV 
TTEEDQEKNKPQKLTCPLCRGHIKEWVWKAARCFMNSKHRSCSCETCDFSGSYSDLR 
KHARLLHPGVRPSEADPERQRSWRRLERQSDLGDLLSTLQSSFGGDEISNDDGFLFAD 
TRLLTVYFLIRVFRPESSGSRSSSWSGTSRARTHTSGRRRSSRPASLWGESYEGNTGT 
SPRDEENNQSSDEQySGTRRRRSRRRTVIDDDDEEEEP" 
CDS complement (join (32197. .32430,32525. .32676,32969. .33332, 

33404. .33487)) 

/inf erence="non-experimental evidence, no additional 
details recorded" 
/note="gene_id: MP.Ell . 9" 
/codon_start=l 

/product^ "chloroplast SOS ribosomal protein LIS" 
/protein_id="BAB01055 . 1" 
/db_xref ="GI : 92795 9 7" 

/translation^ "MATPLSISSNPLTSRHCYRLHLSSTSFKGNVSVLGANPSQILSL 
KLNQTLKTRNQQQFARPLVWSQTAATSSAWAPERPRLDNLGPQPGSRKKQKRKGRG 
ISAGQGASCGFGMRGQKSRSGPGIMRGFEGGQTALYRRLPKLRGIAGGMRSGLPKYLP 
VNIKDIETAGFQEGDEVSLETLKQKGLINPSGRERKLPLKILGTGELSMKLTFKARAF 
STQAKEKLEASGCTLTVLPGRKKWVKPSVAKNQARADEYFAKKRAAAAEAATSEPAAS 
A" 

CDS complement (join (33961. .34021,34114. .34226,34391. .34482, 

34602. .34800)) 

/inf erence= "non-experimental evidence, no additional 
details recorded" 

/note= "unnamed protein product; gb | AAF26483 . 1 

gene_id:MPEll . 10 

similar to unknown protein" 

/codon_start=l 

/protein_id="BAB01056. 1" 

/db_xref="GI : 9279598" 

/translation= "MKNVMLIIDESNASYDLLIWALENQKDTIESSKVYIFAKQPQNS 
FTPPTVLSSSVGFAQIFYPFSPNSELIRLAQEKNMKIALGILEKAKKICLNHGIKAET 
FTDDGDPKDLIRKIIQERNINLIVTSDQQSLKKCTQNTDCSLLWKKRLRKD" 
CDS join(36033. .36150,36243. .36283,364 97. .3656 5,36651. .36707, 

36797. .36871) 



Query Match 100.0%; Score 1226; DB 4; Length 83650; 

Best Local Similarity 100.0%; Pred. No, 0; 

Matches 1226; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Oy 1 GATCTCTATGTATATAATWVTATGGGTAATATTAGAAACTAACTATGAAATGGAAAAGAA 60 

IIIIMIIMIIIIIMIIIIIIIIMIIIIIIIIIIIIIIMIMIMIIIIIIIIIM 

Db 13 06 9 GATCTCTATGTAt ATAAAAATATGGGTAATATTAGAAACTAACTATGAAATGGAAAAGAA 13 010 

Qy 61 TTGAGAGAATGACATTGTGTCAGAAAAGTTAGGTAAATAACATTTTCTGAAAAAGAGAAA 120 

IMIIMIIIMIIIIIIIIIMIMIIillNIIIMIMIMIMIIMMIIIMII 

Db 13 00 9 TTGAGAGAATGACATTGTGTCAGAAAAGTTAGGTAAATAACATTTTCTGAAAAAGAGAAA 12 950 

Qy 121 ATACAAAAATATCCTTGTGTTTACTTATTTTTACAATAATGCCATTGGCTTTAGTTATAA 180 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIINIII 

Db 12 94 9 ATACAAAAATATCCTTGTGTTTACTTATTTTTACAATAATGCCATTGGCTTTAGTTATAA 128 90 



Qy 181 AGTTTATATGTATTTGTCTAAAATAGCATGATATATTTACAAAAATCATGCAATTCTTTA 24 0 

IIIMIIIIMMIllMI'IIIIMMIIIIIMMIillMIMIMMIIIIMMII 

Db 12889 AGTTTATATGTATTTGTCTAAAATAGCATGATATATTTACAAAAATCATGCAATTCTTTA 12830 
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Qy 241 AAATACATACAGAATATATATACACGATATATATGTTTCTCTGAAATAATGTGTTTCTCA 3 00 

llllllllilllllllllllllllllllllllllMIIMIIIIIIIIIMIIIIIIIII 

Db 1282 9 AAATACATACAGAATATATATACACGATATATATGTTTCTCTGAAATAATGTGTTTCTCA 12770 

Qy 3 01 GAAATAGCACGAAATATTTATAAAAAGCATGCAATTCTCTTATAGATCGCGAAGTTTAAA 3 60 

llllllillllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 1276 9 GAAATAGCACGAAATATTTATAAAAAGCATGCAATTCTCTTATAGATCGCGAAGTTTAAA 12710 

Oy 361 AAAACATATAGAATTGTTACAATATTACATGGGTTTTTATTGGATAACATGACAAATATT 4 20 

IIIIIIMIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 1270 9 AAAACATATAGAATTGTTACAATATTACATGGGTTTTTATTGGATAACATGACAAATATT 12650 

Qy 4 21 TATTTATTTCATGAGTTTTTATTGGATAGCATGACAAATATTAATATATCAGTGTTAATA 4 80 

II I Ml liril 1 1 II I III IIMII II III II II MM II Ml Mill II MM II I III 

Db 1264 9 TATTTATTTCATGAGTTTTTATTGGATAGCATGACAAATATTAATATATCAGTGTTAATA 125 90 

Qy 4 81 ACATGTTTTGTTCTTAAAATACATGCATTTTAAAATCAGACATTTGTTTTAAAATCAAAT 54 0 

IIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 12 5'8 9 ACATGTTTTGTTCTTAAAATACATGCATTTTAAAATCAGACATTTGTTTTAAAATCAAAT 12 53 0 

Qy 541 CTAATCTCTTATATCACAACGACATTGACGGAAAATTCAGGTAAAAAGAGAAAATAAAGA 600 

IIIMMIMIMIMIMIMIIMMIMMMIMIMIIIIIIIIIMIIIIIIII 

Db 12529 CTAATGTCTTATATCACAACGACATTGACGGAAAATTCAGGTAAAAAGAGAAAATAAAGA 12470 

Qy 60-1 ATGAGAGATAGAGAGATTTCTATGGAAAAAGAAAGAGAGAACATGTAGGTGAACAAAATA 660 

MIIIIIIIMIIMIMIIMIIIIIIMIMIMIIIMIIIMIMIIMIMIIM 

Db 124 6 9 ATGAGAGATAGAGAGATTTCTATGGAAAAAGAAAG AGAGAACATGT AGGTG AACAAAAT A 12410 

Qy 661 AAGAGATATGATGATATATTTTATGAGAGGTGGTGAAGATTATTTTAGGAGAGGGAGAGA 720 

IIMIIIIIIIMIIIIIMMIIIMMIIIMMIIIMIMIIIIMIMIIIIMI 

Db .1 24 0 9 AAGAGATATGATGATATATTTTATGAGAGGTGGTGAAGATTATTTTAGGAGAGGGAGAGA 123 50 

Qy 721 GAAATAGAAAAAGAAAATGACATGGTGAATCTGAAGAAGATGAATTGTGTTAAAGATGAA 780 

llllllllllllllllllilllllllllllllllllillllllllliilliliiiiiiii 

Db 1234 9 GAAATAGAAAAAGAAAATGACATGGTGAATCTGAAGAAGATGAATTGTGTTAAAGATGAA 12290 

Qy 781 GAGAGAAAGAGAACTCCATGGCTAAAGTCTCGTAAAGAAGATGAAAAAGAAACAAAAGAA 84 0 

I Mill! Ill II II I Mill I II III II I IIIIIMII III III IIMII MUM Ml I 

Db 12289 GAGAGAAAGAGAACTCCATGGCTAAAGTCTCGTAAAGAAGATGAAAAAGAAACAAAAGAA 12230 

Oy 84 1 GGAAGAAGAAAGAGAAAGGCTAAAATAGACTAACTATTGCCAAAATTTCTGTAGCCGACA 900 

I II M M II M M I M II II II II II II II M II II II II II II II II II II II II II II 

Db 12 22 9 GGAAGAAGAAAGAGAAAGGCTAAAATAGACTAACTATTGCCAAAATTTCTGTAGCCGACA 12170 

Qy 901 AATACTATTTGGTCCAAGGTTATTTTGTGTATTCTTTTGAAGTCAAAAGTTATTTCTTAC 960 

MM MM M MM IIMII IM IIMII IIMIIIIIIIIMMIIIIIIIIIMIIII 

\yo 1216 9 AATACTATTTGGTCCAAGGTTATTTTGTGTATTCTTTTGAAGTCAAAAGTTATTTCTTAC 12110 

Qy 961 ATATACTCTAAAAATATAGCCGATACCAATTTTTCCACACATGGACTTCCTTTATTCCAA 102 0 

II I M M I M M M M M M M M M M M M M II M M M M M M M M M M II II 

Db 1210 9 ATATACTCTAAAAATATAGCCGATACCAATTTTTCCACACATGGACTTCCTTTATTCCAA 12050 

Qy 1021 AAGTCAATAAAGTGTGACGTCATGATACTTACGCTTTAAAACATCGCATGATGATGTCAT 108 0 

MMMII IIIMIIMIIIIMIMIIMIIIIIMIMIIIIMIIMIIIMMMI 

Db 12049 AAGTCAATAAAGTGTGACGTCATGATACTTACGCTTTAAAACATCGCATGATGATGTCAT 11990 

Qy 1081 TAGCATCAATCTCCACCGTCCAATTTATTTAGTTGTTGACAATATCGACCGTCTAAGTTC 114 0 
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Db 11989 TAGCATCAATCTCCACCGTCCAATTTATTTAGTTGTTGACAATATCGACCGTCTAAGTTC 11930 

Qy 1141 CACACCGACGGCTATAAGAGTTTCATTATAAATTTTAGCAAAATAAAATCAGCAAATAAT 1200 

IMIIIIIIIIIIIMIMIIIIIIIIIIIMIMIMIMIMIIMIIIIIMIIIII 

Db 11929 CACACCGACGGCTATAAGAGTTTCATTATAAATTTTAGCAAAATAAAATCAGCAAATAAT 11870 

Qy 1201 TTTTTCTTGACTAAGCTTAAACGACG 1226 

lllllllllllltllllllllMIII 
Db 11869 TTTTTCTTGACTAAGCTTAAACGACG 11844 



C 
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APPENDIX B 

SCORE Search Results Details for Application 10566201 and Search Result us-10- 566-201- 
2.rnpbm. 

Score Home Page Retrieve Application List SCORE System Overview SCORE FAQ Comments 
/ Suggestions 



This page gives you Search Results detail for the Application 10566201 and Search 
Result us-10-566-201-2 . rnpbm. 



Go Back i:o previous page 



GenCore version 5.1,9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: 



Title: 

Perfect score: 
Sequence : 



September 28, 2006, 02:25:12 ; Search time 1565 Seconds 

{without alignments) 
9625,969 Million cell updates/sec 

US-10-566-201-2 
1226 

1 gatctctatgtatataaaaa ttgactaagcttaaacgacg 1226 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



18892170 seqs, 6143817638 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



37784340 



Da tabase 



Publ i shed_Appl i ca t i ons J 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 



/EMC_Celerra_SIDS3 
/ EMC_Ce le r ra_S IDS 3 
/ EMC_Ce 1 e r ra_S I DS 3 
/EMC_Celerra_SIDS3 
/ EMC_Ce 1 e r r a_S I DS 3 
/ EMC_Ce 1 e r r a_S I DS 3 
/ EMC_Ce 1 e r r a_S I DS 3 
/ EMC_Ce 1 e r r a_S I DS 3 
/EMC_Celerra_SIDS3 
/EMC_Celerra_SIDS 
/EMC_Celerra_SIDS 
/ EMC_Ce 1 e r r a_S I DS 



s_NA_Main:* 
/ptodata/2/pubpna/US0 
/ptodata/2/pubpna/US0 
/ptodata/2/pubpna/US0 
/ptodata/2/pubpna/US0 
/ptodata/2/pubpna/US0 
/p t oda t a / 2 /pubpna /US 1 
/ptodata/2/pubpna/USl 
/ptodata/2/pubpna/USl 
/ptodata/2/pubpna/USl 
3/ptodata/2/pubpna/US 
3 /ptodata/ 2 /pubpna /US 
3/ptodata/2/pubpna/US 



7_PUBC0MB. seq: * 
8_PUBC0MB.seq: * 
9A_PUBC0MB , seq : * 
9B_PUBC0MB. seq: * 
9C_PUBCOMB. seq: * 
0A_PUBCOMB.seq:* 
0B_PUBCOMB.seq:* 
0C_PUBCOMB, seq: * 
0D_PUBCOMB. seq: * 
10E_PUBCOMB. seq: * 
10F_PUBCOMB . seq : * 
10G_PUBCOMB.seq: ^ 
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13 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/USllA_PUBCOMB. seq: * 

14 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/USllB_PUBCOMB.seq: * 

15 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/USllC_PUBCOMB. seq: * 

16 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/USllD_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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Db 301 GAAAATAAAGAATGAGAGATAGAGAGATTTCTATGGAAAAAGAAAGAGAGAACATGTAGG 360 

Qy 650 TG AACAAAATAAAGAGATATGATGATATATTTTATGAGAGGTGGTG AAGATTATTTTAGG 709 

iiiiiiMiiiiiiiiiiiiiiiiiiiiiiiiriiiiiiiiiiii-iiiiiiiiiiiiiii 

Db 361 TGAACAAAATAAAGAGATATGATGATATATTTTATGAGAGGTGGTGAAGATTATTTTAGG 4 20 

Qy 710 AGAGGGAGAGAGAAATAGAAAAAGAAAATGACATGGTGAATCTGAAGAAGATGAATTGTG 769 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

Db '121 AGAGGGAGAGAGAAATAGAAA/^AGAAAATGACATGGTGAATCTGAAGAAGATGAATTGTG 4 80 

Qy 7 7 0 TTAAAGATGAAGAGAGAAAGAGAACTCCATGGCTAAAGTCTCGTAAAGAAGATGAAAAAG 82 9 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIillllilMMIIIIIIIIIII ' 

Db .4 81 TTAAAGATGAAGAGAGAAAGAGAACTCCATGGCTAAAGTCTCGTAAAGAAGATGAAAAAG 54 0 

Qy 830 AAACAAAAGAAGGAAGAAGAAAGAGAAAGGCTAAAATAGACTAACTATTGCCAAAATTTC 88 9 

IIIMIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIMIIIIMIIIIIIIIMIIIII 

Db 541 AAACAAAAGAAGG;U\GAAGAAAGAGAAAGGCTA7\AATAGACTAACTATTGCCAAAATTTC 600 

Qy 8 90 TGTAGCCGACAAATACTATTTGGTCCAAGGTTATTTTGTGTATTCTTTTGAAGTCAAAAG 94 9 

IIMIMIMIIIMIMIIIIIIMMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIM 

Db 601 TGTAGCCGACAAATACTATTTGGTCCAAGGTTATTTTGTGTATTCTTTTGAAGTCAAAAG 660 

Qy 950 TTATTTCTTACATATACTCTAAAAATATAGCCGATACCAATTTTTCCACACATGGACTTC 100 9 

II I MM 1 1 II III I II MM II I II II 1 1 II II III III II I II MINI 1 1 II II II I 

Db 661 TTATTTCTTACATATACTCTAAAAATATAGCCGATACCAATTTTTCCACACATGGACTTC 720 

Qy 1010 CTTTATTCCAAAAGTCAATAAAGTGTGACGTCATGATACTTACGCTTTAAAACATCGCAT 106 9 

II II II II 1 1 M I M ri M I II I II I II M II II I II I II II 1 1 II II II I II M II II I 

Db 721 CTTTATTCCAAAAGTCAATAAAGTGTGACGTCATGATACTTACGCTTTAAAACATCGCAT 78 0 

Qy 107 0 GATGATGTCATTAGCATCAATCTCCACCGTCCAATTTATTTAGTTGTTGACAATATCGAC 112 9 

IIIIMIIIIMIIIIMIIIIMIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

Db 781 GATGATGTCATTAGCATCAATCTCCACCGTCCAATTTATTTAGTTGTTGACAATATCGAC 84 0 

Qy 113 0 CGTCTAAGTTCCACACCGACGGCTATAAGAGTTTCATTATAAATTTTAGCAAAATAAAAT 1189 

MIIIIIIIMIIMIMIIIIMIMIIIIIIIIIIMIIIMMIMMIIIMMM 

Db 84 1 CGTCTAAGTTCCACACCGACGGCTATAAGAGTTTCATTATAAATTTTAGCAAAATAAAAT 900 

Qy 1190 CAGCAAATAATTTTTTCTTGACTAAGCTTAAACGACG 1226 

IIIIIIMMIIIIIMIIIIIIIIIMMMMIII 

Db 901 CAGCAAATAATTTTTTCTTGACTAAGCTTAAACGACG 93 7 
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44 79.6 6.5 2512 4 US - 09 - 925 - 06 5A- 271 91 Sequence 27191, A 

45 79.6 6.5 2512 4 US- 09- 925- 065A-27192 • Sequence 27192, A 
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RESULT 1 

US-10-800-161-24 

Sequence 24, Application' US/10800161 
Publication No. US20040154051A1 
GENERAL INFORMATION: 
APPLICANT: Cade, Rebecca M 
APPLICANT: Dietrich, Robert A 
APPLICANT: Lawton, Kay Ann 
TITLE OF INVENTION: INDUCIBLE PROMOTERS 
FILE REFERENCE: A-31089CIP1- 

CURRENT APPLICATION NUMBER: US/ 10/800 , 16 1 
CURRENT FILING DATE: 2004-03-12 
PRIOR APPLICATION NUMBER: 60/171,008 
PRIOR FILING DATE: 1999-12-15 
PRIOR APPLICATION NUMBER: 60/175,519 
PRIOR FILING DATE: 2000-01-11 
NUMBER OF SEQ ID NOS : 31 
SOFTWARE: Patent In Ver. 2.i 
SEQ ID. NO 24 
LENGTH: 9.62 
TYPE: DNA 

ORGANISM; Arabidopsis thaliana 
US-10-800-161-24 

Query Match 76.4%; Score 937; DB 8; Length 962; 

Best Local Similarity 100.0%; Pred. No. 1.6e-149; 

Matches 937; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Oy 290 TGTGTTTCTCAGAAATAGCACGAAATATTTATAAAAAGCATGCAATTCTCTTATAGATCG 34 9 

11 II II M III Mil Mil Mill Ml M I MM III II Mill II I lllll II MM I I 

Db . 1 TGTGTTTCTCAGAAATAGCACGAAATATTTATAAAAAGCATGCAATTCTCTTATAGATCG 60 

Qy 3 50 CGAAGTTTAAAAAAACATATAGAATTGTTACAATATTACATGGGTTTTTATTGGATAACA 4 09 

II MM II II Mill II MM III II Ml I II lllll lllll ill! MM I II MM II I 

Db 61 CGAAGTTTAAAAAAACATATAGAATTGTTACAATATTACATGGGTTTTTATTGGATAACA 120 

Qy 410 TGACAAATATTTATTTATTTCATGAGTTTTTATTGGATAGCATGACAAATATTAATATAT 469 

IMIIMIMIIIIMIIIIIIIIIIIIIMMIIIIIMMIMMIIIIIIMMMI 

Db 121 TGACAAATATTTATTTATTTCATGAGTTTTTATTGGATAGCATGACAAATATTAATATAT 180 

Qy 4 70 CAGTGTTAATAACATGTTTTGTTCTTAAAATACATGCATTTTAAAATCAGACATTTGTTT 52 9 

II IMIMIMIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIMIIMIilMMIM 

Db 181 CAGTGTTAATAACATGTTTTGTTCTTAAAATACATGCATTTTAAAATCAGACATTTGTTT 24 0 

Qy -53 0 TAAAATCAAATCTAATCTCTTATATCACAACGACATTGACGGAAAATTCAGGTAAAAAGA 58 9 

MIMIMIMMMMMIMMMIMMMMMMMIMMMMMMMIMI 

Db 241 TAAAATCAAATCTAATCTCTTATATCACAACGACATTGACGGAAAATTCAGGTAAAAAGA 30 0 



5 90 GAAAATAAAGAATGAGAGATAGAGAGATTTCTATGGAAAAAGAAAGAGAGAACATGTAGG 64 9 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMMIIIIIIIIIIIIIIIMIIIIII 



